Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Tweak tuning database plot and comparison scripts #2883

Merged
merged 3 commits into from
Dec 3, 2024

Conversation

bernhardmgruber
Copy link
Contributor

@bernhardmgruber bernhardmgruber commented Nov 19, 2024

Here are some tweaks I made to @gevtushenko tuning database plot and comparison scripts. It fixes a few bugs and changes the behavior from plotting aggregated results across sub benchmarks (e.g. aggregating transform.add and transform.triad) to plotting each sub benchmark separately.

Copy link
Contributor

🟩 CI finished in 5h 06m: Pass: 100%/400 | Total: 8d 04h | Avg: 29m 31s | Max: 1h 25m | Hits: 11%/25894
  • 🟩 libcudacxx: Pass: 100%/118 | Total: 23h 14m | Avg: 11m 48s | Max: 49m 50s | Hits: 3%/9504

    🟩 cpu
      🟩 amd64              Pass: 100%/110 | Total: 21h 58m | Avg: 11m 59s | Max: 49m 50s | Hits:   3%/9504  
      🟩 arm64              Pass: 100%/8   | Total:  1h 15m | Avg:  9m 25s | Max: 15m 58s
    🟩 ctk
      🟩 11.1               Pass: 100%/15  | Total:  1h 42m | Avg:  6m 49s | Max: 44m 06s | Hits:   3%/2182  
      🟩 11.8               Pass: 100%/3   | Total: 29m 00s | Avg:  9m 40s | Max: 15m 55s
      🟩 12.5               Pass: 100%/4   | Total:  2h 41m | Avg: 40m 21s | Max: 49m 50s
      🟩 12.6               Pass: 100%/96  | Total: 18h 21m | Avg: 11m 28s | Max: 46m 14s | Hits:   3%/7322  
    🟩 cudacxx
      🟩 ClangCUDA18        Pass: 100%/12  | Total:  2h 24m | Avg: 12m 01s | Max: 18m 44s
      🟩 nvcc11.1           Pass: 100%/15  | Total:  1h 42m | Avg:  6m 49s | Max: 44m 06s | Hits:   3%/2182  
      🟩 nvcc11.8           Pass: 100%/3   | Total: 29m 00s | Avg:  9m 40s | Max: 15m 55s
      🟩 nvcc12.5           Pass: 100%/4   | Total:  2h 41m | Avg: 40m 21s | Max: 49m 50s
      🟩 nvcc12.6           Pass: 100%/84  | Total: 15h 56m | Avg: 11m 23s | Max: 46m 14s | Hits:   3%/7322  
    🟩 cudacxx_family
      🟩 ClangCUDA          Pass: 100%/12  | Total:  2h 24m | Avg: 12m 01s | Max: 18m 44s
      🟩 nvcc               Pass: 100%/106 | Total: 20h 49m | Avg: 11m 47s | Max: 49m 50s | Hits:   3%/9504  
    🟩 cxx
      🟩 Clang9             Pass: 100%/6   | Total: 34m 21s | Avg:  5m 43s | Max: 11m 28s
      🟩 Clang10            Pass: 100%/3   | Total: 33m 26s | Avg: 11m 08s | Max: 14m 17s
      🟩 Clang11            Pass: 100%/4   | Total: 26m 36s | Avg:  6m 39s | Max: 13m 02s
      🟩 Clang12            Pass: 100%/4   | Total: 28m 51s | Avg:  7m 12s | Max: 16m 20s
      🟩 Clang13            Pass: 100%/4   | Total: 36m 42s | Avg:  9m 10s | Max: 13m 57s
      🟩 Clang14            Pass: 100%/4   | Total: 43m 46s | Avg: 10m 56s | Max: 14m 04s
      🟩 Clang15            Pass: 100%/4   | Total: 51m 24s | Avg: 12m 51s | Max: 19m 36s
      🟩 Clang16            Pass: 100%/4   | Total: 27m 52s | Avg:  6m 58s | Max: 14m 42s
      🟩 Clang17            Pass: 100%/4   | Total: 42m 03s | Avg: 10m 30s | Max: 19m 02s
      🟩 Clang18            Pass: 100%/18  | Total:  3h 43m | Avg: 12m 23s | Max: 18m 44s
      🟩 GCC6               Pass: 100%/2   | Total:  5m 16s | Avg:  2m 38s | Max:  2m 46s
      🟩 GCC7               Pass: 100%/6   | Total: 37m 23s | Avg:  6m 13s | Max: 13m 24s
      🟩 GCC8               Pass: 100%/6   | Total: 38m 41s | Avg:  6m 26s | Max: 14m 14s
      🟩 GCC9               Pass: 100%/6   | Total: 20m 17s | Avg:  3m 22s | Max:  4m 26s
      🟩 GCC10              Pass: 100%/4   | Total: 15m 52s | Avg:  3m 58s | Max:  4m 15s
      🟩 GCC11              Pass: 100%/7   | Total:  1h 15m | Avg: 10m 42s | Max: 15m 55s
      🟩 GCC12              Pass: 100%/4   | Total: 36m 49s | Avg:  9m 12s | Max: 14m 09s
      🟩 GCC13              Pass: 100%/17  | Total:  3h 14m | Avg: 11m 25s | Max: 42m 22s
      🟩 Intel2023.2.0      Pass: 100%/3   | Total:  1h 27m | Avg: 29m 14s | Max: 32m 44s
      🟩 MSVC14.16          Pass: 100%/1   | Total: 44m 06s | Avg: 44m 06s | Max: 44m 06s | Hits:   3%/2182  
      🟩 MSVC14.29          Pass: 100%/2   | Total:  1h 23m | Avg: 41m 31s | Max: 42m 38s | Hits:   3%/4727  
      🟩 MSVC14.39          Pass: 100%/1   | Total: 46m 14s | Avg: 46m 14s | Max: 46m 14s | Hits:   3%/2595  
      🟩 NVHPC24.7          Pass: 100%/4   | Total:  2h 41m | Avg: 40m 21s | Max: 49m 50s
    🟩 cxx_family
      🟩 Clang              Pass: 100%/55  | Total:  9h 08m | Avg:  9m 57s | Max: 19m 36s
      🟩 GCC                Pass: 100%/52  | Total:  7h 03m | Avg:  8m 08s | Max: 42m 22s
      🟩 Intel              Pass: 100%/3   | Total:  1h 27m | Avg: 29m 14s | Max: 32m 44s
      🟩 MSVC               Pass: 100%/4   | Total:  2h 53m | Avg: 43m 20s | Max: 46m 14s | Hits:   3%/9504  
      🟩 NVHPC              Pass: 100%/4   | Total:  2h 41m | Avg: 40m 21s | Max: 49m 50s
    🟩 gpu
      🟩 v100               Pass: 100%/118 | Total: 23h 14m | Avg: 11m 48s | Max: 49m 50s | Hits:   3%/9504  
    🟩 jobs
      🟩 Build              Pass: 100%/110 | Total: 20h 24m | Avg: 11m 08s | Max: 49m 50s | Hits:   3%/9504  
      🟩 NVRTC              Pass: 100%/4   | Total:  1h 34m | Avg: 23m 40s | Max: 29m 42s
      🟩 Test               Pass: 100%/3   | Total:  1h 12m | Avg: 24m 12s | Max: 42m 22s
      🟩 VerifyCodegen      Pass: 100%/1   | Total:  1m 59s | Avg:  1m 59s | Max:  1m 59s
    🟩 sm
      🟩 60;70;80;90        Pass: 100%/3   | Total: 29m 00s | Avg:  9m 40s | Max: 15m 55s
      🟩 90                 Pass: 100%/4   | Total: 42m 49s | Avg: 10m 42s | Max: 13m 33s
      🟩 90a                Pass: 100%/8   | Total: 56m 11s | Avg:  7m 01s | Max: 12m 08s
    🟩 std
      🟩 11                 Pass: 100%/32  | Total:  4h 17m | Avg:  8m 02s | Max: 30m 08s
      🟩 14                 Pass: 100%/32  | Total:  6h 20m | Avg: 11m 54s | Max: 44m 06s | Hits:   3%/4467  
      🟩 17                 Pass: 100%/30  | Total:  6h 34m | Avg: 13m 08s | Max: 41m 08s | Hits:   3%/2442  
      🟩 20                 Pass: 100%/23  | Total:  5h 59m | Avg: 15m 37s | Max: 49m 50s | Hits:   3%/2595  
    
  • 🟩 cub: Pass: 100%/110 | Total: 4d 00h | Avg: 52m 25s | Max: 1h 23m | Hits: 0%/2964

    🟩 cpu
      🟩 amd64              Pass: 100%/102 | Total:  3d 16h | Avg: 52m 04s | Max:  1h 23m | Hits:   0%/2964  
      🟩 arm64              Pass: 100%/8   | Total:  7h 35m | Avg: 56m 54s | Max:  1h 04m
    🟩 ctk
      🟩 11.1               Pass: 100%/15  | Total: 12h 01m | Avg: 48m 07s | Max:  1h 02m | Hits:   0%/741   
      🟩 11.8               Pass: 100%/3   | Total:  3h 24m | Avg:  1h 08m | Max:  1h 08m
      🟩 12.5               Pass: 100%/4   | Total:  4h 45m | Avg:  1h 11m | Max:  1h 14m
      🟩 12.6               Pass: 100%/88  | Total:  3d 03h | Avg: 51m 46s | Max:  1h 23m | Hits:   0%/2223  
    🟩 cudacxx
      🟩 ClangCUDA18        Pass: 100%/4   | Total:  3h 45m | Avg: 56m 15s | Max: 58m 07s
      🟩 nvcc11.1           Pass: 100%/15  | Total: 12h 01m | Avg: 48m 07s | Max:  1h 02m | Hits:   0%/741   
      🟩 nvcc11.8           Pass: 100%/3   | Total:  3h 24m | Avg:  1h 08m | Max:  1h 08m
      🟩 nvcc12.5           Pass: 100%/4   | Total:  4h 45m | Avg:  1h 11m | Max:  1h 14m
      🟩 nvcc12.6           Pass: 100%/84  | Total:  3d 00h | Avg: 51m 33s | Max:  1h 23m | Hits:   0%/2223  
    🟩 cudacxx_family
      🟩 ClangCUDA          Pass: 100%/4   | Total:  3h 45m | Avg: 56m 15s | Max: 58m 07s
      🟩 nvcc               Pass: 100%/106 | Total:  3d 20h | Avg: 52m 17s | Max:  1h 23m | Hits:   0%/2964  
    🟩 cxx
      🟩 Clang9             Pass: 100%/6   | Total:  4h 50m | Avg: 48m 26s | Max: 52m 18s
      🟩 Clang10            Pass: 100%/3   | Total:  2h 43m | Avg: 54m 22s | Max: 58m 11s
      🟩 Clang11            Pass: 100%/4   | Total:  3h 22m | Avg: 50m 40s | Max: 51m 08s
      🟩 Clang12            Pass: 100%/4   | Total:  3h 35m | Avg: 53m 47s | Max: 56m 02s
      🟩 Clang13            Pass: 100%/4   | Total:  3h 21m | Avg: 50m 20s | Max: 50m 57s
      🟩 Clang14            Pass: 100%/4   | Total:  3h 31m | Avg: 52m 59s | Max: 57m 25s
      🟩 Clang15            Pass: 100%/4   | Total:  3h 34m | Avg: 53m 35s | Max: 57m 21s
      🟩 Clang16            Pass: 100%/4   | Total:  3h 24m | Avg: 51m 12s | Max: 53m 33s
      🟩 Clang17            Pass: 100%/4   | Total:  3h 51m | Avg: 57m 55s | Max:  1h 10m
      🟩 Clang18            Pass: 100%/11  | Total:  9h 08m | Avg: 49m 49s | Max: 58m 07s
      🟩 GCC6               Pass: 100%/2   | Total:  1h 30m | Avg: 45m 12s | Max: 45m 38s
      🟩 GCC7               Pass: 100%/6   | Total:  4h 54m | Avg: 49m 04s | Max: 53m 53s
      🟩 GCC8               Pass: 100%/6   | Total:  5h 14m | Avg: 52m 29s | Max:  1h 06m
      🟩 GCC9               Pass: 100%/6   | Total:  5h 15m | Avg: 52m 37s | Max:  1h 02m
      🟩 GCC10              Pass: 100%/4   | Total:  3h 54m | Avg: 58m 36s | Max:  1h 02m
      🟩 GCC11              Pass: 100%/7   | Total:  7h 11m | Avg:  1h 01m | Max:  1h 08m
      🟩 GCC12              Pass: 100%/4   | Total:  3h 49m | Avg: 57m 20s | Max:  1h 05m
      🟩 GCC13              Pass: 100%/16  | Total: 10h 32m | Avg: 39m 31s | Max:  1h 23m
      🟩 Intel2023.2.0      Pass: 100%/3   | Total:  3h 03m | Avg:  1h 01m | Max:  1h 02m
      🟩 MSVC14.16          Pass: 100%/1   | Total: 59m 10s | Avg: 59m 10s | Max: 59m 10s | Hits:   0%/741   
      🟩 MSVC14.29          Pass: 100%/2   | Total:  2h 16m | Avg:  1h 08m | Max:  1h 13m | Hits:   0%/1482  
      🟩 MSVC14.39          Pass: 100%/1   | Total:  1h 16m | Avg:  1h 16m | Max:  1h 16m | Hits:   0%/741   
      🟩 NVHPC24.7          Pass: 100%/4   | Total:  4h 45m | Avg:  1h 11m | Max:  1h 14m
    🟩 cxx_family
      🟩 Clang              Pass: 100%/48  | Total:  1d 17h | Avg: 51m 44s | Max:  1h 10m
      🟩 GCC                Pass: 100%/51  | Total:  1d 18h | Avg: 49m 51s | Max:  1h 23m
      🟩 Intel              Pass: 100%/3   | Total:  3h 03m | Avg:  1h 01m | Max:  1h 02m
      🟩 MSVC               Pass: 100%/4   | Total:  4h 31m | Avg:  1h 07m | Max:  1h 16m | Hits:   0%/2964  
      🟩 NVHPC              Pass: 100%/4   | Total:  4h 45m | Avg:  1h 11m | Max:  1h 14m
    🟩 gpu
      🟩 v100               Pass: 100%/110 | Total:  4d 00h | Avg: 52m 25s | Max:  1h 23m | Hits:   0%/2964  
    🟩 jobs
      🟩 Build              Pass: 100%/102 | Total:  3d 20h | Avg: 54m 10s | Max:  1h 16m | Hits:   0%/2964  
      🟩 DeviceLaunch       Pass: 100%/1   | Total: 17m 48s | Avg: 17m 48s | Max: 17m 48s
      🟩 GraphCapture       Pass: 100%/1   | Total:  1h 23m | Avg:  1h 23m | Max:  1h 23m
      🟩 HostLaunch         Pass: 100%/3   | Total: 54m 38s | Avg: 18m 12s | Max: 18m 41s
      🟩 TestGPU            Pass: 100%/3   | Total:  1h 24m | Avg: 28m 14s | Max: 39m 11s
    🟩 sm
      🟩 60;70;80;90        Pass: 100%/3   | Total:  3h 24m | Avg:  1h 08m | Max:  1h 08m
      🟩 90a                Pass: 100%/4   | Total:  1h 34m | Avg: 23m 44s | Max: 25m 25s
    🟩 std
      🟩 11                 Pass: 100%/30  | Total:  1d 01h | Avg: 51m 15s | Max:  1h 14m
      🟩 14                 Pass: 100%/29  | Total:  1d 01h | Avg: 53m 46s | Max:  1h 08m | Hits:   0%/1482  
      🟩 17                 Pass: 100%/27  | Total:  1d 00h | Avg: 53m 47s | Max:  1h 13m | Hits:   0%/741   
      🟩 20                 Pass: 100%/24  | Total: 20h 17m | Avg: 50m 43s | Max:  1h 23m | Hits:   0%/741   
    
  • 🟩 thrust: Pass: 100%/109 | Total: 2d 23h | Avg: 39m 05s | Max: 1h 25m | Hits: 20%/13180

    🟩 cpu
      🟩 amd64              Pass: 100%/101 | Total:  2d 18h | Avg: 39m 20s | Max:  1h 25m | Hits:  20%/13180 
      🟩 arm64              Pass: 100%/8   | Total:  4h 48m | Avg: 36m 01s | Max: 42m 25s
    🟩 ctk
      🟩 11.1               Pass: 100%/15  | Total:  9h 31m | Avg: 38m 06s | Max:  1h 17m | Hits:   0%/2636  
      🟩 11.8               Pass: 100%/3   | Total:  2h 30m | Avg: 50m 07s | Max: 55m 54s
      🟩 12.5               Pass: 100%/4   | Total:  5h 25m | Avg:  1h 21m | Max:  1h 24m
      🟩 12.6               Pass: 100%/87  | Total:  2d 05h | Avg: 36m 56s | Max:  1h 25m | Hits:  25%/10544 
    🟩 cudacxx
      🟩 ClangCUDA18        Pass: 100%/4   | Total:  2h 01m | Avg: 30m 17s | Max: 35m 20s
      🟩 nvcc11.1           Pass: 100%/15  | Total:  9h 31m | Avg: 38m 06s | Max:  1h 17m | Hits:   0%/2636  
      🟩 nvcc11.8           Pass: 100%/3   | Total:  2h 30m | Avg: 50m 07s | Max: 55m 54s
      🟩 nvcc12.5           Pass: 100%/4   | Total:  5h 25m | Avg:  1h 21m | Max:  1h 24m
      🟩 nvcc12.6           Pass: 100%/83  | Total:  2d 03h | Avg: 37m 15s | Max:  1h 25m | Hits:  25%/10544 
    🟩 cudacxx_family
      🟩 ClangCUDA          Pass: 100%/4   | Total:  2h 01m | Avg: 30m 17s | Max: 35m 20s
      🟩 nvcc               Pass: 100%/105 | Total:  2d 21h | Avg: 39m 26s | Max:  1h 25m | Hits:  20%/13180 
    🟩 cxx
      🟩 Clang9             Pass: 100%/6   | Total:  3h 29m | Avg: 34m 57s | Max: 44m 04s
      🟩 Clang10            Pass: 100%/3   | Total:  1h 56m | Avg: 38m 56s | Max: 41m 13s
      🟩 Clang11            Pass: 100%/4   | Total:  2h 20m | Avg: 35m 07s | Max: 37m 49s
      🟩 Clang12            Pass: 100%/4   | Total:  2h 39m | Avg: 39m 51s | Max: 50m 28s
      🟩 Clang13            Pass: 100%/4   | Total:  2h 20m | Avg: 35m 06s | Max: 37m 32s
      🟩 Clang14            Pass: 100%/4   | Total:  2h 36m | Avg: 39m 02s | Max: 43m 57s
      🟩 Clang15            Pass: 100%/4   | Total:  2h 37m | Avg: 39m 26s | Max: 49m 22s
      🟩 Clang16            Pass: 100%/4   | Total:  2h 25m | Avg: 36m 21s | Max: 39m 40s
      🟩 Clang17            Pass: 100%/4   | Total:  2h 36m | Avg: 39m 09s | Max: 49m 14s
      🟩 Clang18            Pass: 100%/11  | Total:  5h 13m | Avg: 28m 31s | Max: 37m 34s
      🟩 GCC6               Pass: 100%/2   | Total:  1h 06m | Avg: 33m 20s | Max: 36m 23s
      🟩 GCC7               Pass: 100%/6   | Total:  3h 57m | Avg: 39m 38s | Max: 46m 34s
      🟩 GCC8               Pass: 100%/6   | Total:  3h 48m | Avg: 38m 07s | Max: 39m 50s
      🟩 GCC9               Pass: 100%/6   | Total:  3h 35m | Avg: 35m 55s | Max: 40m 01s
      🟩 GCC10              Pass: 100%/4   | Total:  2h 31m | Avg: 37m 50s | Max: 40m 32s
      🟩 GCC11              Pass: 100%/7   | Total:  5h 10m | Avg: 44m 25s | Max: 55m 54s
      🟩 GCC12              Pass: 100%/4   | Total:  2h 34m | Avg: 38m 35s | Max: 43m 02s
      🟩 GCC13              Pass: 100%/14  | Total:  6h 08m | Avg: 26m 19s | Max: 43m 10s
      🟩 Intel2023.2.0      Pass: 100%/3   | Total:  2h 54m | Avg: 58m 08s | Max:  1h 02m
      🟩 MSVC14.16          Pass: 100%/1   | Total:  1h 17m | Avg:  1h 17m | Max:  1h 17m | Hits:   0%/2636  
      🟩 MSVC14.29          Pass: 100%/2   | Total:  2h 21m | Avg:  1h 10m | Max:  1h 12m | Hits:   0%/5272  
      🟩 MSVC14.39          Pass: 100%/2   | Total:  1h 51m | Avg: 55m 56s | Max:  1h 25m | Hits:  50%/5272  
      🟩 NVHPC24.7          Pass: 100%/4   | Total:  5h 25m | Avg:  1h 21m | Max:  1h 24m
    🟩 cxx_family
      🟩 Clang              Pass: 100%/48  | Total:  1d 04h | Avg: 35m 20s | Max: 50m 28s
      🟩 GCC                Pass: 100%/49  | Total:  1d 04h | Avg: 35m 23s | Max: 55m 54s
      🟩 Intel              Pass: 100%/3   | Total:  2h 54m | Avg: 58m 08s | Max:  1h 02m
      🟩 MSVC               Pass: 100%/5   | Total:  5h 30m | Avg:  1h 06m | Max:  1h 25m | Hits:  20%/13180 
      🟩 NVHPC              Pass: 100%/4   | Total:  5h 25m | Avg:  1h 21m | Max:  1h 24m
    🟩 gpu
      🟩 v100               Pass: 100%/109 | Total:  2d 23h | Avg: 39m 05s | Max:  1h 25m | Hits:  20%/13180 
    🟩 jobs
      🟩 Build              Pass: 100%/102 | Total:  2d 21h | Avg: 40m 51s | Max:  1h 25m | Hits:   0%/10544 
      🟩 TestCPU            Pass: 100%/4   | Total: 51m 16s | Avg: 12m 49s | Max: 26m 43s | Hits:  99%/2636  
      🟩 TestGPU            Pass: 100%/3   | Total: 42m 18s | Avg: 14m 06s | Max: 17m 27s
    🟩 sm
      🟩 60;70;80;90        Pass: 100%/3   | Total:  2h 30m | Avg: 50m 07s | Max: 55m 54s
      🟩 90a                Pass: 100%/4   | Total:  1h 33m | Avg: 23m 18s | Max: 27m 02s
    🟩 std
      🟩 11                 Pass: 100%/30  | Total: 16h 40m | Avg: 33m 20s | Max:  1h 13m
      🟩 14                 Pass: 100%/29  | Total: 20h 33m | Avg: 42m 31s | Max:  1h 23m | Hits:   0%/5272  
      🟩 17                 Pass: 100%/27  | Total: 19h 34m | Avg: 43m 30s | Max:  1h 24m | Hits:   0%/2636  
      🟩 20                 Pass: 100%/23  | Total: 14h 13m | Avg: 37m 06s | Max:  1h 25m | Hits:  50%/5272  
    
  • 🟩 cudax: Pass: 100%/54 | Total: 5h 35m | Avg: 6m 12s | Max: 18m 48s | Hits: 2%/246

    🟩 cpu
      🟩 amd64              Pass: 100%/50  | Total:  5h 16m | Avg:  6m 20s | Max: 18m 48s | Hits:   2%/246   
      🟩 arm64              Pass: 100%/4   | Total: 18m 20s | Avg:  4m 35s | Max:  4m 56s
    🟩 ctk
      🟩 12.0               Pass: 100%/19  | Total:  2h 11m | Avg:  6m 55s | Max: 18m 32s | Hits:   2%/123   
      🟩 12.5               Pass: 100%/2   | Total: 23m 18s | Avg: 11m 39s | Max: 12m 09s
      🟩 12.6               Pass: 100%/33  | Total:  3h 00m | Avg:  5m 27s | Max: 18m 48s | Hits:   2%/123   
    🟩 cudacxx
      🟩 nvcc12.0           Pass: 100%/19  | Total:  2h 11m | Avg:  6m 55s | Max: 18m 32s | Hits:   2%/123   
      🟩 nvcc12.5           Pass: 100%/2   | Total: 23m 18s | Avg: 11m 39s | Max: 12m 09s
      🟩 nvcc12.6           Pass: 100%/33  | Total:  3h 00m | Avg:  5m 27s | Max: 18m 48s | Hits:   2%/123   
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/54  | Total:  5h 35m | Avg:  6m 12s | Max: 18m 48s | Hits:   2%/246   
    🟩 cxx
      🟩 Clang9             Pass: 100%/2   | Total: 10m 07s | Avg:  5m 03s | Max:  5m 55s
      🟩 Clang10            Pass: 100%/2   | Total: 10m 11s | Avg:  5m 05s | Max:  6m 01s
      🟩 Clang11            Pass: 100%/4   | Total: 18m 45s | Avg:  4m 41s | Max:  5m 44s
      🟩 Clang12            Pass: 100%/4   | Total: 18m 20s | Avg:  4m 35s | Max:  5m 14s
      🟩 Clang13            Pass: 100%/4   | Total: 17m 57s | Avg:  4m 29s | Max:  5m 32s
      🟩 Clang14            Pass: 100%/4   | Total: 29m 36s | Avg:  7m 24s | Max: 18m 32s
      🟩 Clang15            Pass: 100%/2   | Total:  7m 59s | Avg:  3m 59s | Max:  4m 09s
      🟩 Clang16            Pass: 100%/4   | Total: 17m 15s | Avg:  4m 18s | Max:  4m 37s
      🟩 Clang17            Pass: 100%/2   | Total:  7m 34s | Avg:  3m 47s | Max:  3m 51s
      🟩 Clang18            Pass: 100%/2   | Total: 22m 55s | Avg: 11m 27s | Max: 18m 48s
      🟩 GCC9               Pass: 100%/2   | Total:  9m 29s | Avg:  4m 44s | Max:  5m 57s
      🟩 GCC10              Pass: 100%/4   | Total: 19m 10s | Avg:  4m 47s | Max:  6m 07s
      🟩 GCC11              Pass: 100%/4   | Total: 18m 52s | Avg:  4m 43s | Max:  5m 49s
      🟩 GCC12              Pass: 100%/7   | Total:  1h 06m | Avg:  9m 33s | Max: 18m 32s
      🟩 GCC13              Pass: 100%/3   | Total: 13m 00s | Avg:  4m 20s | Max:  4m 56s
      🟩 MSVC14.36          Pass: 100%/1   | Total: 12m 04s | Avg: 12m 04s | Max: 12m 04s | Hits:   2%/123   
      🟩 MSVC14.39          Pass: 100%/1   | Total: 11m 47s | Avg: 11m 47s | Max: 11m 47s | Hits:   2%/123   
      🟩 NVHPC24.7          Pass: 100%/2   | Total: 23m 18s | Avg: 11m 39s | Max: 12m 09s
    🟩 cxx_family
      🟩 Clang              Pass: 100%/30  | Total:  2h 40m | Avg:  5m 21s | Max: 18m 48s
      🟩 GCC                Pass: 100%/20  | Total:  2h 07m | Avg:  6m 22s | Max: 18m 32s
      🟩 MSVC               Pass: 100%/2   | Total: 23m 51s | Avg: 11m 55s | Max: 12m 04s | Hits:   2%/246   
      🟩 NVHPC              Pass: 100%/2   | Total: 23m 18s | Avg: 11m 39s | Max: 12m 09s
    🟩 gpu
      🟩 v100               Pass: 100%/54  | Total:  5h 35m | Avg:  6m 12s | Max: 18m 48s | Hits:   2%/246   
    🟩 jobs
      🟩 Build              Pass: 100%/49  | Total:  4h 08m | Avg:  5m 03s | Max: 12m 09s | Hits:   2%/246   
      🟩 Test               Pass: 100%/5   | Total:  1h 26m | Avg: 17m 23s | Max: 18m 48s
    🟩 sm
      🟩 90                 Pass: 100%/1   | Total:  4m 45s | Avg:  4m 45s | Max:  4m 45s
      🟩 90a                Pass: 100%/1   | Total:  3m 20s | Avg:  3m 20s | Max:  3m 20s
    🟩 std
      🟩 17                 Pass: 100%/29  | Total:  2h 41m | Avg:  5m 34s | Max: 16m 00s
      🟩 20                 Pass: 100%/25  | Total:  2h 53m | Avg:  6m 56s | Max: 18m 48s | Hits:   2%/246   
    
  • 🟩 cccl: Pass: 100%/6 | Total: 27m 50s | Avg: 4m 38s | Max: 5m 14s

    🟩 cpu
      🟩 amd64              Pass: 100%/6   | Total: 27m 50s | Avg:  4m 38s | Max:  5m 14s
    🟩 ctk
      🟩 11.1               Pass: 100%/2   | Total:  7m 52s | Avg:  3m 56s | Max:  4m 13s
      🟩 12.0               Pass: 100%/2   | Total:  9m 48s | Avg:  4m 54s | Max:  5m 03s
      🟩 12.6               Pass: 100%/2   | Total: 10m 10s | Avg:  5m 05s | Max:  5m 14s
    🟩 cudacxx
      🟩 nvcc11.1           Pass: 100%/2   | Total:  7m 52s | Avg:  3m 56s | Max:  4m 13s
      🟩 nvcc12.0           Pass: 100%/2   | Total:  9m 48s | Avg:  4m 54s | Max:  5m 03s
      🟩 nvcc12.6           Pass: 100%/2   | Total: 10m 10s | Avg:  5m 05s | Max:  5m 14s
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/6   | Total: 27m 50s | Avg:  4m 38s | Max:  5m 14s
    🟩 cxx
      🟩 Clang9             Pass: 100%/1   | Total:  4m 13s | Avg:  4m 13s | Max:  4m 13s
      🟩 Clang14            Pass: 100%/1   | Total:  5m 03s | Avg:  5m 03s | Max:  5m 03s
      🟩 Clang18            Pass: 100%/1   | Total:  5m 14s | Avg:  5m 14s | Max:  5m 14s
      🟩 GCC6               Pass: 100%/1   | Total:  3m 39s | Avg:  3m 39s | Max:  3m 39s
      🟩 GCC12              Pass: 100%/1   | Total:  4m 45s | Avg:  4m 45s | Max:  4m 45s
      🟩 GCC13              Pass: 100%/1   | Total:  4m 56s | Avg:  4m 56s | Max:  4m 56s
    🟩 cxx_family
      🟩 Clang              Pass: 100%/3   | Total: 14m 30s | Avg:  4m 50s | Max:  5m 14s
      🟩 GCC                Pass: 100%/3   | Total: 13m 20s | Avg:  4m 26s | Max:  4m 56s
    🟩 gpu
      🟩 v100               Pass: 100%/6   | Total: 27m 50s | Avg:  4m 38s | Max:  5m 14s
    🟩 jobs
      🟩 Infra              Pass: 100%/6   | Total: 27m 50s | Avg:  4m 38s | Max:  5m 14s
    
  • 🟩 cccl_c_parallel: Pass: 100%/2 | Total: 9m 46s | Avg: 4m 53s | Max: 7m 34s

    🟩 cpu
      🟩 amd64              Pass: 100%/2   | Total:  9m 46s | Avg:  4m 53s | Max:  7m 34s
    🟩 ctk
      🟩 12.6               Pass: 100%/2   | Total:  9m 46s | Avg:  4m 53s | Max:  7m 34s
    🟩 cudacxx
      🟩 nvcc12.6           Pass: 100%/2   | Total:  9m 46s | Avg:  4m 53s | Max:  7m 34s
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/2   | Total:  9m 46s | Avg:  4m 53s | Max:  7m 34s
    🟩 cxx
      🟩 GCC13              Pass: 100%/2   | Total:  9m 46s | Avg:  4m 53s | Max:  7m 34s
    🟩 cxx_family
      🟩 GCC                Pass: 100%/2   | Total:  9m 46s | Avg:  4m 53s | Max:  7m 34s
    🟩 gpu
      🟩 v100               Pass: 100%/2   | Total:  9m 46s | Avg:  4m 53s | Max:  7m 34s
    🟩 jobs
      🟩 Build              Pass: 100%/1   | Total:  2m 12s | Avg:  2m 12s | Max:  2m 12s
      🟩 Test               Pass: 100%/1   | Total:  7m 34s | Avg:  7m 34s | Max:  7m 34s
    
  • 🟩 python: Pass: 100%/1 | Total: 15m 29s | Avg: 15m 29s | Max: 15m 29s

    🟩 cpu
      🟩 amd64              Pass: 100%/1   | Total: 15m 29s | Avg: 15m 29s | Max: 15m 29s
    🟩 ctk
      🟩 12.6               Pass: 100%/1   | Total: 15m 29s | Avg: 15m 29s | Max: 15m 29s
    🟩 cudacxx
      🟩 nvcc12.6           Pass: 100%/1   | Total: 15m 29s | Avg: 15m 29s | Max: 15m 29s
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/1   | Total: 15m 29s | Avg: 15m 29s | Max: 15m 29s
    🟩 cxx
      🟩 GCC13              Pass: 100%/1   | Total: 15m 29s | Avg: 15m 29s | Max: 15m 29s
    🟩 cxx_family
      🟩 GCC                Pass: 100%/1   | Total: 15m 29s | Avg: 15m 29s | Max: 15m 29s
    🟩 gpu
      🟩 v100               Pass: 100%/1   | Total: 15m 29s | Avg: 15m 29s | Max: 15m 29s
    🟩 jobs
      🟩 Test               Pass: 100%/1   | Total: 15m 29s | Avg: 15m 29s | Max: 15m 29s
    

👃 Inspect Changes

Modifications in project?

Project
+/- CCCL Infrastructure
libcu++
CUB
Thrust
CUDA Experimental
python
CCCL C Parallel Library
Catch2Helper

Modifications in project or dependencies?

Project
+/- CCCL Infrastructure
+/- libcu++
+/- CUB
+/- Thrust
+/- CUDA Experimental
+/- python
+/- CCCL C Parallel Library
+/- Catch2Helper

🏃‍ Runner counts (total jobs: 400)

# Runner
326 linux-amd64-cpu16
31 linux-amd64-gpu-v100-latest-1
28 linux-arm64-cpu16
15 windows-amd64-cpu16

Comment on lines +110 to +115
def print_speedup(medians):
m = medians.groupby(['alg', 'hue'], sort=False).mean()
m['speedup'] = (m['bw'] / m.groupby(['alg'])['bw'].transform('first'))
print('# Speedups:')
print()
print(m.drop(columns='bw').sort_values(by='speedup', ascending=False).to_markdown())
Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

This feels a bit misplaced here, but it's great to have the speedup's from the plot as a copyable table as well., since compare.py works on a much more detailed level.

* Mark compare as executable
* Plot and compare each sub bench
* Avoid confusing use of 'bw'
* Don't filter the runs
Copy link

copy-pr-bot bot commented Dec 3, 2024

Auto-sync is disabled for draft pull requests in this repository. Workflows must be run manually.

Contributors can view more details about this message here.

@bernhardmgruber bernhardmgruber marked this pull request as ready for review December 3, 2024 14:58
@bernhardmgruber bernhardmgruber requested a review from a team as a code owner December 3, 2024 14:58
@bernhardmgruber
Copy link
Contributor Author

The scripts are definitely a bit hacky, but I need those changes regularly, so let's proceed and get them merged.

Copy link
Contributor

github-actions bot commented Dec 3, 2024

🟩 CI finished in 5h 15m: Pass: 100%/402 | Total: 6d 20h | Avg: 24m 30s | Max: 1h 24m | Hits: 68%/22121
  • 🟩 libcudacxx: Pass: 100%/118 | Total: 1d 05h | Avg: 15m 00s | Max: 50m 00s | Hits: 55%/9577

    🟩 cpu
      🟩 amd64              Pass: 100%/110 | Total:  1d 03h | Avg: 15m 16s | Max: 50m 00s | Hits:  55%/9577  
      🟩 arm64              Pass: 100%/8   | Total:  1h 32m | Avg: 11m 32s | Max: 13m 42s
    🟩 ctk
      🟩 11.1               Pass: 100%/15  | Total:  3h 41m | Avg: 14m 45s | Max: 34m 11s | Hits:  38%/2210  
      🟩 11.8               Pass: 100%/3   | Total: 51m 56s | Avg: 17m 18s | Max: 20m 15s
      🟩 12.5               Pass: 100%/4   | Total:  1h 32m | Avg: 23m 08s | Max: 30m 00s
      🟩 12.6               Pass: 100%/96  | Total: 23h 26m | Avg: 14m 38s | Max: 50m 00s | Hits:  61%/7367  
    🟩 cudacxx
      🟩 ClangCUDA18        Pass: 100%/12  | Total:  2h 34m | Avg: 12m 50s | Max: 20m 05s
      🟩 nvcc11.1           Pass: 100%/15  | Total:  3h 41m | Avg: 14m 45s | Max: 34m 11s | Hits:  38%/2210  
      🟩 nvcc11.8           Pass: 100%/3   | Total: 51m 56s | Avg: 17m 18s | Max: 20m 15s
      🟩 nvcc12.5           Pass: 100%/4   | Total:  1h 32m | Avg: 23m 08s | Max: 30m 00s
      🟩 nvcc12.6           Pass: 100%/84  | Total: 20h 51m | Avg: 14m 54s | Max: 50m 00s | Hits:  61%/7367  
    🟩 cudacxx_family
      🟩 ClangCUDA          Pass: 100%/12  | Total:  2h 34m | Avg: 12m 50s | Max: 20m 05s
      🟩 nvcc               Pass: 100%/106 | Total:  1d 02h | Avg: 15m 15s | Max: 50m 00s | Hits:  55%/9577  
    🟩 cxx
      🟩 Clang9             Pass: 100%/6   | Total:  1h 16m | Avg: 12m 44s | Max: 20m 09s
      🟩 Clang10            Pass: 100%/3   | Total: 39m 16s | Avg: 13m 05s | Max: 13m 37s
      🟩 Clang11            Pass: 100%/4   | Total: 59m 47s | Avg: 14m 56s | Max: 20m 59s
      🟩 Clang12            Pass: 100%/4   | Total:  1h 06m | Avg: 16m 42s | Max: 21m 02s
      🟩 Clang13            Pass: 100%/4   | Total: 59m 03s | Avg: 14m 45s | Max: 17m 42s
      🟩 Clang14            Pass: 100%/4   | Total: 59m 36s | Avg: 14m 54s | Max: 20m 21s
      🟩 Clang15            Pass: 100%/4   | Total: 51m 09s | Avg: 12m 47s | Max: 14m 38s
      🟩 Clang16            Pass: 100%/4   | Total: 57m 09s | Avg: 14m 17s | Max: 22m 12s
      🟩 Clang17            Pass: 100%/4   | Total: 49m 47s | Avg: 12m 26s | Max: 12m 58s
      🟩 Clang18            Pass: 100%/18  | Total:  4h 12m | Avg: 14m 02s | Max: 37m 39s
      🟩 GCC6               Pass: 100%/2   | Total: 28m 51s | Avg: 14m 25s | Max: 20m 57s
      🟩 GCC7               Pass: 100%/6   | Total:  1h 21m | Avg: 13m 39s | Max: 21m 01s
      🟩 GCC8               Pass: 100%/6   | Total:  1h 19m | Avg: 13m 12s | Max: 20m 11s
      🟩 GCC9               Pass: 100%/6   | Total:  1h 17m | Avg: 12m 58s | Max: 21m 00s
      🟩 GCC10              Pass: 100%/4   | Total: 48m 34s | Avg: 12m 08s | Max: 13m 14s
      🟩 GCC11              Pass: 100%/7   | Total:  1h 46m | Avg: 15m 10s | Max: 20m 15s
      🟩 GCC12              Pass: 100%/4   | Total: 55m 16s | Avg: 13m 49s | Max: 16m 00s
      🟩 GCC13              Pass: 100%/17  | Total:  4h 27m | Avg: 15m 44s | Max: 50m 00s
      🟩 Intel2023.2.0      Pass: 100%/3   | Total: 39m 35s | Avg: 13m 11s | Max: 13m 57s
      🟩 MSVC14.16          Pass: 100%/1   | Total: 34m 11s | Avg: 34m 11s | Max: 34m 11s | Hits:  38%/2210  
      🟩 MSVC14.29          Pass: 100%/2   | Total:  1h 00m | Avg: 30m 05s | Max: 32m 56s | Hits:  55%/4761  
      🟩 MSVC14.39          Pass: 100%/1   | Total: 28m 12s | Avg: 28m 12s | Max: 28m 12s | Hits:  70%/2606  
      🟩 NVHPC24.7          Pass: 100%/4   | Total:  1h 32m | Avg: 23m 08s | Max: 30m 00s
    🟩 cxx_family
      🟩 Clang              Pass: 100%/55  | Total: 12h 51m | Avg: 14m 01s | Max: 37m 39s
      🟩 GCC                Pass: 100%/52  | Total: 12h 25m | Avg: 14m 20s | Max: 50m 00s
      🟩 Intel              Pass: 100%/3   | Total: 39m 35s | Avg: 13m 11s | Max: 13m 57s
      🟩 MSVC               Pass: 100%/4   | Total:  2h 02m | Avg: 30m 38s | Max: 34m 11s | Hits:  55%/9577  
      🟩 NVHPC              Pass: 100%/4   | Total:  1h 32m | Avg: 23m 08s | Max: 30m 00s
    🟩 gpu
      🟩 v100               Pass: 100%/118 | Total:  1d 05h | Avg: 15m 00s | Max: 50m 00s | Hits:  55%/9577  
    🟩 jobs
      🟩 Build              Pass: 100%/110 | Total:  1d 02h | Avg: 14m 15s | Max: 34m 11s | Hits:  55%/9577  
      🟩 NVRTC              Pass: 100%/4   | Total:  1h 47m | Avg: 26m 49s | Max: 50m 00s
      🟩 Test               Pass: 100%/3   | Total:  1h 33m | Avg: 31m 19s | Max: 40m 35s
      🟩 VerifyCodegen      Pass: 100%/1   | Total:  2m 20s | Avg:  2m 20s | Max:  2m 20s
    🟩 sm
      🟩 60;70;80;90        Pass: 100%/3   | Total: 51m 56s | Avg: 17m 18s | Max: 20m 15s
      🟩 90                 Pass: 100%/4   | Total: 43m 54s | Avg: 10m 58s | Max: 13m 05s
      🟩 90a                Pass: 100%/8   | Total:  1h 15m | Avg:  9m 29s | Max: 12m 26s
    🟩 std
      🟩 11                 Pass: 100%/32  | Total:  7h 28m | Avg: 14m 00s | Max: 21m 01s
      🟩 14                 Pass: 100%/32  | Total:  7h 08m | Avg: 13m 24s | Max: 34m 11s | Hits:  37%/4512  
      🟩 17                 Pass: 100%/30  | Total:  7h 43m | Avg: 15m 26s | Max: 30m 00s | Hits:  73%/2459  
      🟩 20                 Pass: 100%/23  | Total:  7h 09m | Avg: 18m 39s | Max: 50m 00s | Hits:  70%/2606  
    
  • 🟩 thrust: Pass: 100%/111 | Total: 1d 07h | Avg: 16m 54s | Max: 56m 09s | Hits: 81%/9260

    🟩 cmake_options
      🟩 -DTHRUST_DISPATCH_TYPE=Force32bit Pass: 100%/2   | Total: 27m 21s | Avg: 13m 40s | Max: 16m 00s
    🟩 cpu
      🟩 amd64              Pass: 100%/103 | Total:  1d 05h | Avg: 17m 18s | Max: 56m 09s | Hits:  81%/9260  
      🟩 arm64              Pass: 100%/8   | Total:  1h 35m | Avg: 11m 54s | Max: 16m 24s
    🟩 ctk
      🟩 11.1               Pass: 100%/15  | Total:  4h 27m | Avg: 17m 51s | Max: 29m 59s | Hits:  91%/1852  
      🟩 11.8               Pass: 100%/3   | Total:  1h 25m | Avg: 28m 22s | Max: 34m 30s
      🟩 12.5               Pass: 100%/4   | Total:  2h 56m | Avg: 44m 03s | Max: 52m 10s
      🟩 12.6               Pass: 100%/89  | Total: 22h 28m | Avg: 15m 08s | Max: 56m 09s | Hits:  78%/7408  
    🟩 cudacxx
      🟩 ClangCUDA18        Pass: 100%/4   | Total:  1h 32m | Avg: 23m 11s | Max: 25m 50s
      🟩 nvcc11.1           Pass: 100%/15  | Total:  4h 27m | Avg: 17m 51s | Max: 29m 59s | Hits:  91%/1852  
      🟩 nvcc11.8           Pass: 100%/3   | Total:  1h 25m | Avg: 28m 22s | Max: 34m 30s
      🟩 nvcc12.5           Pass: 100%/4   | Total:  2h 56m | Avg: 44m 03s | Max: 52m 10s
      🟩 nvcc12.6           Pass: 100%/85  | Total: 20h 55m | Avg: 14m 46s | Max: 56m 09s | Hits:  78%/7408  
    🟩 cudacxx_family
      🟩 ClangCUDA          Pass: 100%/4   | Total:  1h 32m | Avg: 23m 11s | Max: 25m 50s
      🟩 nvcc               Pass: 100%/107 | Total:  1d 05h | Avg: 16m 40s | Max: 56m 09s | Hits:  81%/9260  
    🟩 cxx
      🟩 Clang9             Pass: 100%/6   | Total:  1h 26m | Avg: 14m 28s | Max: 17m 10s
      🟩 Clang10            Pass: 100%/3   | Total: 41m 24s | Avg: 13m 48s | Max: 20m 12s
      🟩 Clang11            Pass: 100%/4   | Total: 53m 35s | Avg: 13m 23s | Max: 15m 18s
      🟩 Clang12            Pass: 100%/4   | Total: 52m 26s | Avg: 13m 06s | Max: 18m 05s
      🟩 Clang13            Pass: 100%/4   | Total: 50m 48s | Avg: 12m 42s | Max: 16m 24s
      🟩 Clang14            Pass: 100%/4   | Total: 56m 15s | Avg: 14m 03s | Max: 17m 46s
      🟩 Clang15            Pass: 100%/4   | Total: 57m 44s | Avg: 14m 26s | Max: 18m 42s
      🟩 Clang16            Pass: 100%/4   | Total:  1h 00m | Avg: 15m 07s | Max: 20m 31s
      🟩 Clang17            Pass: 100%/4   | Total: 55m 03s | Avg: 13m 45s | Max: 17m 45s
      🟩 Clang18            Pass: 100%/11  | Total:  2h 54m | Avg: 15m 52s | Max: 25m 50s
      🟩 GCC6               Pass: 100%/2   | Total: 30m 24s | Avg: 15m 12s | Max: 17m 03s
      🟩 GCC7               Pass: 100%/6   | Total:  1h 24m | Avg: 14m 01s | Max: 17m 57s
      🟩 GCC8               Pass: 100%/6   | Total:  1h 38m | Avg: 16m 29s | Max: 22m 59s
      🟩 GCC9               Pass: 100%/6   | Total:  1h 31m | Avg: 15m 16s | Max: 22m 40s
      🟩 GCC10              Pass: 100%/4   | Total: 57m 22s | Avg: 14m 20s | Max: 19m 00s
      🟩 GCC11              Pass: 100%/7   | Total:  2h 16m | Avg: 19m 31s | Max: 34m 30s
      🟩 GCC12              Pass: 100%/4   | Total:  1h 05m | Avg: 16m 22s | Max: 20m 39s
      🟩 GCC13              Pass: 100%/16  | Total:  2h 51m | Avg: 10m 43s | Max: 18m 26s
      🟩 Intel2023.2.0      Pass: 100%/3   | Total:  1h 27m | Avg: 29m 19s | Max: 33m 36s
      🟩 MSVC14.16          Pass: 100%/1   | Total: 29m 59s | Avg: 29m 59s | Max: 29m 59s | Hits:  91%/1852  
      🟩 MSVC14.29          Pass: 100%/2   | Total:  1h 25m | Avg: 42m 54s | Max: 56m 09s | Hits:  75%/3704  
      🟩 MSVC14.39          Pass: 100%/2   | Total:  1h 12m | Avg: 36m 01s | Max: 48m 01s | Hits:  82%/3704  
      🟩 NVHPC24.7          Pass: 100%/4   | Total:  2h 56m | Avg: 44m 03s | Max: 52m 10s
    🟩 cxx_family
      🟩 Clang              Pass: 100%/48  | Total: 11h 29m | Avg: 14m 21s | Max: 25m 50s
      🟩 GCC                Pass: 100%/51  | Total: 12h 16m | Avg: 14m 26s | Max: 34m 30s
      🟩 Intel              Pass: 100%/3   | Total:  1h 27m | Avg: 29m 19s | Max: 33m 36s
      🟩 MSVC               Pass: 100%/5   | Total:  3h 07m | Avg: 37m 34s | Max: 56m 09s | Hits:  81%/9260  
      🟩 NVHPC              Pass: 100%/4   | Total:  2h 56m | Avg: 44m 03s | Max: 52m 10s
    🟩 gpu
      🟩 v100               Pass: 100%/111 | Total:  1d 07h | Avg: 16m 54s | Max: 56m 09s | Hits:  81%/9260  
    🟩 jobs
      🟩 Build              Pass: 100%/103 | Total:  1d 05h | Avg: 17m 06s | Max: 56m 09s | Hits:  76%/7408  
      🟩 TestCPU            Pass: 100%/4   | Total: 47m 07s | Avg: 11m 46s | Max: 24m 02s | Hits:  99%/1852  
      🟩 TestGPU            Pass: 100%/4   | Total:  1h 07m | Avg: 16m 58s | Max: 19m 50s
    🟩 sm
      🟩 60;70;80;90        Pass: 100%/3   | Total:  1h 25m | Avg: 28m 22s | Max: 34m 30s
      🟩 90a                Pass: 100%/4   | Total: 19m 28s | Avg:  4m 52s | Max:  5m 15s
    🟩 std
      🟩 11                 Pass: 100%/30  | Total:  5h 24m | Avg: 10m 48s | Max: 33m 19s
      🟩 14                 Pass: 100%/29  | Total:  9h 08m | Avg: 18m 55s | Max: 56m 09s | Hits:  75%/3704  
      🟩 17                 Pass: 100%/27  | Total:  8h 53m | Avg: 19m 46s | Max: 52m 10s | Hits:  90%/1852  
      🟩 20                 Pass: 100%/23  | Total:  7h 23m | Avg: 19m 16s | Max: 49m 52s | Hits:  82%/3704  
    
  • 🟩 cub: Pass: 100%/110 | Total: 3d 18h | Avg: 49m 08s | Max: 1h 24m | Hits: 69%/3028

    🟩 cpu
      🟩 amd64              Pass: 100%/102 | Total:  3d 11h | Avg: 49m 00s | Max:  1h 24m | Hits:  69%/3028  
      🟩 arm64              Pass: 100%/8   | Total:  6h 45m | Avg: 50m 39s | Max: 58m 43s
    🟩 ctk
      🟩 11.1               Pass: 100%/15  | Total:  9h 17m | Avg: 37m 08s | Max: 53m 36s | Hits:  69%/757   
      🟩 11.8               Pass: 100%/3   | Total:  2h 56m | Avg: 58m 47s | Max:  1h 03m
      🟩 12.5               Pass: 100%/4   | Total:  3h 32m | Avg: 53m 08s | Max: 55m 57s
      🟩 12.6               Pass: 100%/88  | Total:  3d 02h | Avg: 50m 40s | Max:  1h 24m | Hits:  69%/2271  
    🟩 cudacxx
      🟩 ClangCUDA18        Pass: 100%/4   | Total:  3h 34m | Avg: 53m 43s | Max: 54m 51s
      🟩 nvcc11.1           Pass: 100%/15  | Total:  9h 17m | Avg: 37m 08s | Max: 53m 36s | Hits:  69%/757   
      🟩 nvcc11.8           Pass: 100%/3   | Total:  2h 56m | Avg: 58m 47s | Max:  1h 03m
      🟩 nvcc12.5           Pass: 100%/4   | Total:  3h 32m | Avg: 53m 08s | Max: 55m 57s
      🟩 nvcc12.6           Pass: 100%/84  | Total:  2d 22h | Avg: 50m 31s | Max:  1h 24m | Hits:  69%/2271  
    🟩 cudacxx_family
      🟩 ClangCUDA          Pass: 100%/4   | Total:  3h 34m | Avg: 53m 43s | Max: 54m 51s
      🟩 nvcc               Pass: 100%/106 | Total:  3d 14h | Avg: 48m 57s | Max:  1h 24m | Hits:  69%/3028  
    🟩 cxx
      🟩 Clang9             Pass: 100%/6   | Total:  4h 26m | Avg: 44m 27s | Max: 54m 31s
      🟩 Clang10            Pass: 100%/3   | Total:  2h 39m | Avg: 53m 10s | Max: 56m 49s
      🟩 Clang11            Pass: 100%/4   | Total:  3h 31m | Avg: 52m 52s | Max: 54m 13s
      🟩 Clang12            Pass: 100%/4   | Total:  3h 26m | Avg: 51m 37s | Max: 53m 38s
      🟩 Clang13            Pass: 100%/4   | Total:  3h 28m | Avg: 52m 11s | Max: 53m 54s
      🟩 Clang14            Pass: 100%/4   | Total:  3h 30m | Avg: 52m 38s | Max: 55m 09s
      🟩 Clang15            Pass: 100%/4   | Total:  3h 24m | Avg: 51m 14s | Max: 53m 43s
      🟩 Clang16            Pass: 100%/4   | Total:  3h 29m | Avg: 52m 19s | Max: 53m 36s
      🟩 Clang17            Pass: 100%/4   | Total:  3h 37m | Avg: 54m 27s | Max: 58m 28s
      🟩 Clang18            Pass: 100%/11  | Total:  9h 58m | Avg: 54m 23s | Max:  1h 24m
      🟩 GCC6               Pass: 100%/2   | Total:  1h 09m | Avg: 34m 46s | Max: 35m 14s
      🟩 GCC7               Pass: 100%/6   | Total:  4h 22m | Avg: 43m 42s | Max: 51m 38s
      🟩 GCC8               Pass: 100%/6   | Total:  4h 26m | Avg: 44m 27s | Max: 54m 36s
      🟩 GCC9               Pass: 100%/6   | Total:  4h 22m | Avg: 43m 47s | Max: 53m 30s
      🟩 GCC10              Pass: 100%/4   | Total:  3h 15m | Avg: 48m 53s | Max: 50m 48s
      🟩 GCC11              Pass: 100%/7   | Total:  6h 18m | Avg: 54m 00s | Max:  1h 03m
      🟩 GCC12              Pass: 100%/4   | Total:  3h 26m | Avg: 51m 34s | Max: 57m 25s
      🟩 GCC13              Pass: 100%/16  | Total: 11h 16m | Avg: 42m 15s | Max:  1h 21m
      🟩 Intel2023.2.0      Pass: 100%/3   | Total:  2h 23m | Avg: 47m 50s | Max: 48m 30s
      🟩 MSVC14.16          Pass: 100%/1   | Total: 53m 36s | Avg: 53m 36s | Max: 53m 36s | Hits:  69%/757   
      🟩 MSVC14.29          Pass: 100%/2   | Total:  2h 02m | Avg:  1h 01m | Max:  1h 03m | Hits:  70%/1514  
      🟩 MSVC14.39          Pass: 100%/1   | Total:  1h 01m | Avg:  1h 01m | Max:  1h 01m | Hits:  65%/757   
      🟩 NVHPC24.7          Pass: 100%/4   | Total:  3h 32m | Avg: 53m 08s | Max: 55m 57s
    🟩 cxx_family
      🟩 Clang              Pass: 100%/48  | Total:  1d 17h | Avg: 51m 57s | Max:  1h 24m
      🟩 GCC                Pass: 100%/51  | Total:  1d 14h | Avg: 45m 26s | Max:  1h 21m
      🟩 Intel              Pass: 100%/3   | Total:  2h 23m | Avg: 47m 50s | Max: 48m 30s
      🟩 MSVC               Pass: 100%/4   | Total:  3h 57m | Avg: 59m 20s | Max:  1h 03m | Hits:  69%/3028  
      🟩 NVHPC              Pass: 100%/4   | Total:  3h 32m | Avg: 53m 08s | Max: 55m 57s
    🟩 gpu
      🟩 v100               Pass: 100%/110 | Total:  3d 18h | Avg: 49m 08s | Max:  1h 24m | Hits:  69%/3028  
    🟩 jobs
      🟩 Build              Pass: 100%/102 | Total:  3d 10h | Avg: 48m 34s | Max:  1h 03m | Hits:  69%/3028  
      🟩 DeviceLaunch       Pass: 100%/1   | Total: 17m 34s | Avg: 17m 34s | Max: 17m 34s
      🟩 GraphCapture       Pass: 100%/1   | Total: 18m 12s | Avg: 18m 12s | Max: 18m 12s
      🟩 HostLaunch         Pass: 100%/3   | Total:  3h 46m | Avg:  1h 15m | Max:  1h 20m
      🟩 TestGPU            Pass: 100%/3   | Total:  3h 07m | Avg:  1h 02m | Max:  1h 24m
    🟩 sm
      🟩 60;70;80;90        Pass: 100%/3   | Total:  2h 56m | Avg: 58m 47s | Max:  1h 03m
      🟩 90a                Pass: 100%/4   | Total:  1h 14m | Avg: 18m 39s | Max: 19m 45s
    🟩 std
      🟩 11                 Pass: 100%/30  | Total:  1d 00h | Avg: 48m 43s | Max:  1h 21m
      🟩 14                 Pass: 100%/29  | Total: 23h 24m | Avg: 48m 25s | Max:  1h 03m | Hits:  71%/1514  
      🟩 17                 Pass: 100%/27  | Total: 22h 04m | Avg: 49m 04s | Max: 58m 50s | Hits:  66%/757   
      🟩 20                 Pass: 100%/24  | Total: 20h 13m | Avg: 50m 34s | Max:  1h 24m | Hits:  65%/757   
    
  • 🟩 cudax: Pass: 100%/54 | Total: 12h 22m | Avg: 13m 45s | Max: 19m 25s | Hits: 45%/256

    🟩 cpu
      🟩 amd64              Pass: 100%/50  | Total: 11h 28m | Avg: 13m 46s | Max: 19m 25s | Hits:  45%/256   
      🟩 arm64              Pass: 100%/4   | Total: 53m 50s | Avg: 13m 27s | Max: 14m 55s
    🟩 ctk
      🟩 12.0               Pass: 100%/19  | Total:  4h 16m | Avg: 13m 29s | Max: 17m 59s | Hits:  45%/128   
      🟩 12.5               Pass: 100%/2   | Total: 15m 36s | Avg:  7m 48s | Max:  8m 17s
      🟩 12.6               Pass: 100%/33  | Total:  7h 50m | Avg: 14m 15s | Max: 19m 25s | Hits:  46%/128   
    🟩 cudacxx
      🟩 nvcc12.0           Pass: 100%/19  | Total:  4h 16m | Avg: 13m 29s | Max: 17m 59s | Hits:  45%/128   
      🟩 nvcc12.5           Pass: 100%/2   | Total: 15m 36s | Avg:  7m 48s | Max:  8m 17s
      🟩 nvcc12.6           Pass: 100%/33  | Total:  7h 50m | Avg: 14m 15s | Max: 19m 25s | Hits:  46%/128   
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/54  | Total: 12h 22m | Avg: 13m 45s | Max: 19m 25s | Hits:  45%/256   
    🟩 cxx
      🟩 Clang9             Pass: 100%/2   | Total: 25m 50s | Avg: 12m 55s | Max: 13m 22s
      🟩 Clang10            Pass: 100%/2   | Total: 27m 42s | Avg: 13m 51s | Max: 14m 00s
      🟩 Clang11            Pass: 100%/4   | Total: 52m 53s | Avg: 13m 13s | Max: 14m 24s
      🟩 Clang12            Pass: 100%/4   | Total: 54m 06s | Avg: 13m 31s | Max: 14m 12s
      🟩 Clang13            Pass: 100%/4   | Total: 54m 11s | Avg: 13m 32s | Max: 14m 31s
      🟩 Clang14            Pass: 100%/4   | Total: 55m 31s | Avg: 13m 52s | Max: 15m 52s
      🟩 Clang15            Pass: 100%/2   | Total: 29m 09s | Avg: 14m 34s | Max: 14m 40s
      🟩 Clang16            Pass: 100%/4   | Total: 54m 59s | Avg: 13m 44s | Max: 14m 48s
      🟩 Clang17            Pass: 100%/2   | Total: 28m 52s | Avg: 14m 26s | Max: 14m 59s
      🟩 Clang18            Pass: 100%/2   | Total: 33m 19s | Avg: 16m 39s | Max: 17m 41s
      🟩 GCC9               Pass: 100%/2   | Total: 26m 38s | Avg: 13m 19s | Max: 13m 33s
      🟩 GCC10              Pass: 100%/4   | Total: 55m 32s | Avg: 13m 53s | Max: 14m 39s
      🟩 GCC11              Pass: 100%/4   | Total: 56m 43s | Avg: 14m 10s | Max: 15m 23s
      🟩 GCC12              Pass: 100%/7   | Total:  1h 48m | Avg: 15m 29s | Max: 19m 25s
      🟩 GCC13              Pass: 100%/3   | Total: 38m 13s | Avg: 12m 44s | Max: 14m 55s
      🟩 MSVC14.36          Pass: 100%/1   | Total: 13m 13s | Avg: 13m 13s | Max: 13m 13s | Hits:  45%/128   
      🟩 MSVC14.39          Pass: 100%/1   | Total: 11m 51s | Avg: 11m 51s | Max: 11m 51s | Hits:  46%/128   
      🟩 NVHPC24.7          Pass: 100%/2   | Total: 15m 36s | Avg:  7m 48s | Max:  8m 17s
    🟩 cxx_family
      🟩 Clang              Pass: 100%/30  | Total:  6h 56m | Avg: 13m 53s | Max: 17m 41s
      🟩 GCC                Pass: 100%/20  | Total:  4h 45m | Avg: 14m 16s | Max: 19m 25s
      🟩 MSVC               Pass: 100%/2   | Total: 25m 04s | Avg: 12m 32s | Max: 13m 13s | Hits:  45%/256   
      🟩 NVHPC              Pass: 100%/2   | Total: 15m 36s | Avg:  7m 48s | Max:  8m 17s
    🟩 gpu
      🟩 v100               Pass: 100%/54  | Total: 12h 22m | Avg: 13m 45s | Max: 19m 25s | Hits:  45%/256   
    🟩 jobs
      🟩 Build              Pass: 100%/49  | Total: 10h 53m | Avg: 13m 20s | Max: 15m 38s | Hits:  45%/256   
      🟩 Test               Pass: 100%/5   | Total:  1h 28m | Avg: 17m 44s | Max: 19m 25s
    🟩 sm
      🟩 90                 Pass: 100%/1   | Total: 10m 44s | Avg: 10m 44s | Max: 10m 44s
      🟩 90a                Pass: 100%/1   | Total: 10m 08s | Avg: 10m 08s | Max: 10m 08s
    🟩 std
      🟩 17                 Pass: 100%/29  | Total:  6h 31m | Avg: 13m 29s | Max: 19m 25s
      🟩 20                 Pass: 100%/25  | Total:  5h 51m | Avg: 14m 03s | Max: 17m 45s | Hits:  45%/256   
    
  • 🟩 cccl: Pass: 100%/6 | Total: 30m 26s | Avg: 5m 04s | Max: 6m 16s

    🟩 cpu
      🟩 amd64              Pass: 100%/6   | Total: 30m 26s | Avg:  5m 04s | Max:  6m 16s
    🟩 ctk
      🟩 11.1               Pass: 100%/2   | Total:  8m 16s | Avg:  4m 08s | Max:  4m 32s
      🟩 12.0               Pass: 100%/2   | Total: 10m 23s | Avg:  5m 11s | Max:  5m 32s
      🟩 12.6               Pass: 100%/2   | Total: 11m 47s | Avg:  5m 53s | Max:  6m 16s
    🟩 cudacxx
      🟩 nvcc11.1           Pass: 100%/2   | Total:  8m 16s | Avg:  4m 08s | Max:  4m 32s
      🟩 nvcc12.0           Pass: 100%/2   | Total: 10m 23s | Avg:  5m 11s | Max:  5m 32s
      🟩 nvcc12.6           Pass: 100%/2   | Total: 11m 47s | Avg:  5m 53s | Max:  6m 16s
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/6   | Total: 30m 26s | Avg:  5m 04s | Max:  6m 16s
    🟩 cxx
      🟩 Clang9             Pass: 100%/1   | Total:  4m 32s | Avg:  4m 32s | Max:  4m 32s
      🟩 Clang14            Pass: 100%/1   | Total:  4m 51s | Avg:  4m 51s | Max:  4m 51s
      🟩 Clang18            Pass: 100%/1   | Total:  5m 31s | Avg:  5m 31s | Max:  5m 31s
      🟩 GCC6               Pass: 100%/1   | Total:  3m 44s | Avg:  3m 44s | Max:  3m 44s
      🟩 GCC12              Pass: 100%/1   | Total:  5m 32s | Avg:  5m 32s | Max:  5m 32s
      🟩 GCC13              Pass: 100%/1   | Total:  6m 16s | Avg:  6m 16s | Max:  6m 16s
    🟩 cxx_family
      🟩 Clang              Pass: 100%/3   | Total: 14m 54s | Avg:  4m 58s | Max:  5m 31s
      🟩 GCC                Pass: 100%/3   | Total: 15m 32s | Avg:  5m 10s | Max:  6m 16s
    🟩 gpu
      🟩 v100               Pass: 100%/6   | Total: 30m 26s | Avg:  5m 04s | Max:  6m 16s
    🟩 jobs
      🟩 Infra              Pass: 100%/6   | Total: 30m 26s | Avg:  5m 04s | Max:  6m 16s
    
  • 🟩 cccl_c_parallel: Pass: 100%/2 | Total: 9m 54s | Avg: 4m 57s | Max: 7m 28s

    🟩 cpu
      🟩 amd64              Pass: 100%/2   | Total:  9m 54s | Avg:  4m 57s | Max:  7m 28s
    🟩 ctk
      🟩 12.6               Pass: 100%/2   | Total:  9m 54s | Avg:  4m 57s | Max:  7m 28s
    🟩 cudacxx
      🟩 nvcc12.6           Pass: 100%/2   | Total:  9m 54s | Avg:  4m 57s | Max:  7m 28s
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/2   | Total:  9m 54s | Avg:  4m 57s | Max:  7m 28s
    🟩 cxx
      🟩 GCC13              Pass: 100%/2   | Total:  9m 54s | Avg:  4m 57s | Max:  7m 28s
    🟩 cxx_family
      🟩 GCC                Pass: 100%/2   | Total:  9m 54s | Avg:  4m 57s | Max:  7m 28s
    🟩 gpu
      🟩 v100               Pass: 100%/2   | Total:  9m 54s | Avg:  4m 57s | Max:  7m 28s
    🟩 jobs
      🟩 Build              Pass: 100%/1   | Total:  2m 26s | Avg:  2m 26s | Max:  2m 26s
      🟩 Test               Pass: 100%/1   | Total:  7m 28s | Avg:  7m 28s | Max:  7m 28s
    
  • 🟩 python: Pass: 100%/1 | Total: 14m 52s | Avg: 14m 52s | Max: 14m 52s

    🟩 cpu
      🟩 amd64              Pass: 100%/1   | Total: 14m 52s | Avg: 14m 52s | Max: 14m 52s
    🟩 ctk
      🟩 12.6               Pass: 100%/1   | Total: 14m 52s | Avg: 14m 52s | Max: 14m 52s
    🟩 cudacxx
      🟩 nvcc12.6           Pass: 100%/1   | Total: 14m 52s | Avg: 14m 52s | Max: 14m 52s
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/1   | Total: 14m 52s | Avg: 14m 52s | Max: 14m 52s
    🟩 cxx
      🟩 GCC13              Pass: 100%/1   | Total: 14m 52s | Avg: 14m 52s | Max: 14m 52s
    🟩 cxx_family
      🟩 GCC                Pass: 100%/1   | Total: 14m 52s | Avg: 14m 52s | Max: 14m 52s
    🟩 gpu
      🟩 v100               Pass: 100%/1   | Total: 14m 52s | Avg: 14m 52s | Max: 14m 52s
    🟩 jobs
      🟩 Test               Pass: 100%/1   | Total: 14m 52s | Avg: 14m 52s | Max: 14m 52s
    

👃 Inspect Changes

Modifications in project?

Project
+/- CCCL Infrastructure
libcu++
CUB
Thrust
CUDA Experimental
python
CCCL C Parallel Library
Catch2Helper

Modifications in project or dependencies?

Project
+/- CCCL Infrastructure
+/- libcu++
+/- CUB
+/- Thrust
+/- CUDA Experimental
+/- python
+/- CCCL C Parallel Library
+/- Catch2Helper

🏃‍ Runner counts (total jobs: 402)

# Runner
327 linux-amd64-cpu16
32 linux-amd64-gpu-v100-latest-1
28 linux-arm64-cpu16
15 windows-amd64-cpu16

@bernhardmgruber bernhardmgruber merged commit 642976c into NVIDIA:main Dec 3, 2024
418 checks passed
@bernhardmgruber bernhardmgruber deleted the sol branch December 3, 2024 23:36
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
Status: Done
Development

Successfully merging this pull request may close these issues.

2 participants